On sampling error in genetic programming

نویسندگان

چکیده

Abstract The initial population in genetic programming (GP) should form a representative sample of all possible solutions (the search space). While large populations accurately approximate the distribution solutions, small tend to incorporate sampling error. This paper analyzes how size GP affects error and contributes answering question populations. First, we present probabilistic model expected number subtrees for initialized with full, grow, or ramped half-and-half. Second, based on our frequency model, that estimates given size. We validate models empirically show that, compared smaller sizes, recommended sizes largely reduce measured fitness values. Increasing even more, however, does not considerably Last, recommend some widely used benchmark problem instances result low A at initialization is necessary (but sufficient) reliable since lowering means overall random variations are reduced. Our results indicate severe GP, making obtain allows practitioners determine minimum so lower than threshold, confidence level.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

automatic verification of authentication protocols using genetic programming

implicit and unobserved errors and vulnerabilities issues usually arise in cryptographic protocols and especially in authentication protocols. this may enable an attacker to make serious damages to the desired system, such as having the access to or changing secret documents, interfering in bank transactions, having access to users’ accounts, or may be having the control all over the syste...

15 صفحه اول

Sampling Methods in Genetic Programming for Classification with Unbalanced Data

This work investigates the use of sampling methods in Genetic Programming (GP) to improve the classification accuracy in binary classification problems in which the datasets have a class imbalance. Class imbalance occurs when there are more data instances in one class than the other. As a consequence of this imbalance, when overall classification rate is used as the fitness function, as in stan...

متن کامل

Sampling Issues of Tournament Selection in Genetic Programming

Tournament selection is one of the most commonly used parent selection schemes in Genetic Programming (GP). While it has a number of advantages over other selection schemes, it still has some issues that need to be thoroughly investigated. Two of the issues are assocated with the sampling process from the population into the tournament. The first one is the socalled “multi-sampled” issue, where...

متن کامل

Progressive Sampling for Association Rules Based on Sampling Error Estimation

We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. SEE has two advantages over previous works in the literature. First, SEE is highly efficient because an appropriate sample size can be determined without the need of executing association rules. Second, the identified ...

متن کامل

Truncation Error on Wavelet Sampling Expansions

(see [9] and [10]). Throughout this work, we assume that the function satis®es the following conditions: (i) j …x†j …cons:† jB…x†j=jxj1‡"; where " 0 and jB…x†j is bounded and 1-periodic function on R. (ii) P n2Z …n† eÿin converges absolutely to a function that has no zeros on ‰ÿ ; Š. It is known that conditions (i) and (ii) imply that fK…x; n†; n 2 Zg is a Riesz basis on V , with a unique biort...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Natural Computing

سال: 2021

ISSN: ['1572-9796', '1567-7818']

DOI: https://doi.org/10.1007/s11047-020-09828-w